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Abstract — The capacity of ideal MIMO channels has an high- 
SNR slope that equals the minimum of the number of transmit 
and receive antennas. This letter shows that physical MIMO 
channels behave fundamentally different, due to distortions from 
transceiver impairments. The capacity has a finite upper limit 
that holds for any channel distribution and SNR. The high-SNR 
slope is thus zero, but the relative capacity gain of employing 
multiple antennas is at least as large as for ideal transceivers. 

Index Terms — Channel capacity, high-SNR analysis, multi- 
antenna communication, transceiver impairments. 

I. Introduction 

In the past decade, a vast number of papers have stud- 
ied multiple-input multiple-output (MIMO) communications 
motivated by the impressive capacity scaling in the high- 
SNR regime. The seminal article (TJ by E. Telatar shows 
that the MIMO capacity with channel knowledge at the 
receiver behaves as mm(N t , N r ) log 2 (P) + 0(1), where N t 
and N r are the total number of transmit and receive antennas, 
respectively, and P is the signal-to-noise ratio (SNR). The 
factor min(iV t , N r ) is the asymptotic gain over single-antenna 
channels and is called degrees of freedom or multiplexing gain. 

Some skepticism concerning the applicability of these re- 
sults in cellular networks has recently appeared; modest gains 
of network MIMO over conventional schemes have been 
observed and the throughput might even decrease due to the 
extra overhead (2), (3). One explanation is the finite channel 
coherence time that limits the resources for channel acquisition 
[4 1 and coordination between nodes (3), thus creating a finite 
fundamental ceiling for the network spectral efficiency — 
irrespectively of the power and the number of antennas. 

While these results concern large network MIMO systems, 
there is another non-ideality that also affects performance and 
manifests itself for MIMO systems of any size: transceiver im- 
pairments [ 5 1 — [ 1 1 . Physical radio-frequency (RF) transceivers 
suffer from amplifier non-linearities, IQ-imbalance, phase 
noise, quantization noise, carrier-frequency and sampling-rate 
jitter/offsets, etc. These impairments are conventionally over- 
looked in information theoretic studies, but this letter shows 
that they have a non-negligible and fundamental impact on the 
spectral efficiency in modern deployments with high SNR. 

This letter analyzes the capacity of a generalized MIMO 
channel with transceiver impairments. We generalize results 
from (5J and (7) and emphasize the high-SNR behavior, since 
the capacity has a finite limit and thus is fundamentally 
different from the ideal case in (TJ. The main conclusion 
is that the classic multiplexing gain is zero, but the relative 
improvement over single-antenna channels can be even larger 
for these physical MIMO channels than with ideal transceivers. 
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Fig. 1. Block diagram of the generalized MIMO channel considered in 
this letter. Unlink the classical channel model in [1 1, the transmitter-distortion 
generated by physical transceiver implementations is included in the model. 

II. Generalized Channel Model 

Consider a flat-fading MIMO channel with N t transmit an- 
tennas and N r receive antennas. The received signal y € 
in the classical affine baseband channel model of (TJ is 

y = v / PHx + n, (1) 

where P is the SNR, x E C Nt is the intended signal, 
and n ~ CAf(0, 1) is circular-symmetric complex Gaussian 
noise. The channel matrix H 6 C N r><Nt j s assumed to be 
a random variable H having any multi-variate distribution /m 
with the normalized gain E{tr(H ff H)} = N t N r and full-rank 
realizations (i.e., rank(H) = mm(N t , N r )) almost surely — 
this basically covers all physical channel distributions. 

The intended signal in (TJ is only affected by a multi- 
plicative channel transformation and additive thermal noise, 
thus ideal transceiver hardware is implicitly assumed. Physical 
transceivers suffer from a variety of impairments that are 
not properly described by ([TJ (3J-|[T0|. The influence of 
impairments is reduced by compensation schemes, leaving a 
residual distortion that is well-modeled as additive Gaussian 
noise on the baseband with a variance that scales with P (TJ. 

Building on the analysis and measurements in [6|-|8|, the 
combined influence of impairments in the transmitter hardware 
is modeled by the generalized MIMO channel 

y = x/PH(x + »7 t ) + n, (2) 

where the transmitter-distortion T] f G C Nt describes the (resid- 
ual) impairments. This term describes the mismatch between 
the intended signal x and the signal actually generated by the 
transmitter; see the block diagram in Fig. ffl 

Under the normalized power constraint] tr(Q) = 1 with 
Q = E{xx H } (similar to Q), the transmitter-distortion is 

r7 t ~CAA(0,Y t (Q)) with T t = diag(«i(Q), . . . ,v Nt (Q)). 

The distortion depends on the intended signal x in the sense 
that the variance t'„(Q) is an increasing function of the signal 
power q n at the nth transmit antenna (i.e., the nth diago- 
nal element of Q). We neglect any cross-correlation in T t 

'The power constraint is only defined on the intended signal, although 
distortions also contribute a small amount of power. However, this extra power 
is fully characterized by the SNR, P, and we therefore assume that P is 
selected to make the total power usage fulfill all external system constraints. 
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between antennas]^ In multi-carrier (e.g., OFDM) scenarios, 
the channel model (2} can describe each individual subcarrier. 
However, there is some distortion-leakage between subcarriers 
and for simplicity we model this on a subcarrier basis as 
leakage between the antennas (the impact of what is done on 
the different antenna at one subcarrier is likely to average out 
when having many subcarriers). To capture a range of cases 
we propose 

W „(Q), K 2 ((l-a) ? „l^) (3) 

where the parameter a € [0, 1] enables transition from one 
(a = 0) to many (a = 1) subcarriers. The parameter k > is the 
level of impairments jj This model is a good characterization of 
phase noise and IQ-imbalance, while amplifier non-linearities 
(that make v n (Q) increase with P) are neglected [6] — the 
dynamic range is assumed to always match the output power. 

III. Analysis of Channel Capacity 

The transmitter knows the channel distribution /h, while 
the receiver knows the realization H. The capacity of Q is 

C Nt ,N r (P)= sup X(x;y,H) (4) 

/ x :tr(E{xx«})=tr(Q)=l 

where fx is the PDF of x and !(•,-;•) is conditional mutual 
information. Note that I(x;y,H) = E H {I(x;y|H = H)}. 

Lemma 1. The capacity CN t ,N r (P) can be expressed as 



N t > N r . Informally speaking, the lower and upper bounds 
are tight when the high-SNR capacity-achieving Q is isotropic 
in a subspace of size N t and size mm(N r , N t ), respectively. 
The following corollaries exemplify these extremes. 

Corollary 1. Suppose the channel distribution is right- 
rotationally invariant (e.g., H ~ HU for any unitary matrix 
U). The capacity is achieved by Q = -^T for any P and a. 
The lower bound in |6]l is asymptotically tight for any Nt. 

Proof: The right-rotational invariance implies that the 
N dimensions of H H H are isotropically distributed, thus 
the concavity of E{logdet(-)} makes an isotropic covariance 
matrix optimal. The lower bound in |6]l is asymptotically tight 
as it is constructed using this isotropic covariance matrix. ■ 

This corollary covers Rayleigh fading channels that are 
uncorrelated at the transmit side, but also other channel 
distributions with isotropic spatial directivity at the transmitter. 

The special case of a deterministic channel matrix enables 
stronger adaptivity of Q and achieves the upper bound in ([6j. 

Corollary 2. Suppose a = 1 and the channel H is determinis- 
tic and full rank. Let H H H = UmA^/U^ denote a compact 
eigendecomposition where Am = diag(Ai, . . . , Am ) contains 
the non-zero eigenvalues and the semi-unitary XJm £ C WtXM 
contains the corresponding eigenvectors. The capacity is 



■mma i. ine capacity ^N t ,N T K r ) can De expressed as rj N N (p) = ^ log 2 ^1 + ' j 

■tr(Q)=i' H ^ ^ 2 < ^ e *("''^^ :> ^'"^^ (PHT f H +1) )| (5) fg r = \jx — where /j, is selected to make 2i=i 



(7) 



Q:tr(Q> 

and is achieved by x ~ CJV(0, Q) for some feasible Q y 0. 

Proof: For any realization H = H and fixed P, |2]) 
is a classical MIMO channel but with the noise covariance 
(PHT ( H H + 1). Eq. |5]l and the sufficiency of using a 
Gaussian distribution on x then follows from (TJ. ■ 
Although the capacity expression in <|3j appears similar to 
that of the classical MIMO channel in ([TJ and (TJ, it behaves 
very differently — particularly in the high-SNR regime. 

Theorem 1. The asymptotic capacity limit CN t ,N r {°°) = 
limp^oo CN t ,N r {P) is finite and bounded as 



M log 2 1 1+ -a I <C NuNr (oo) < M log 2 ( 1 + 



Mk : 



(6) 



where M = mm(N t , N r ). The lower bound is asymptotically 
achieved by Q = ^-1. The two bounds coincide if Nt < N r . 

Proof: The proof is given in the appendix. ■ 
This theorem shows that physical MIMO systems have a 
finite capacity limit in the high-SNR regime — this is funda- 
mentally different from the unbounded asymptotic capacity 
for ideal transceivers |JJ. Furthermore, the bounds in |6]l hold 
for any channel distribution and are only characterized by the 
number of antennas and the level of impairments k. 

The bounds in |6]l coincide for Nt < N r , while only the 
upper bound grows with the number of transmit antennas when 

2 The paper J8J predicts such a correlation, but it is typically small. 

3 The error vector magnitude, EVM = SHr^rtj , is a common measure 

^{l[x|l / 

for quantifying RF transceiver impairments. Observe that the EVM equals 
re 2 for the considered u n (Q) in j5J. EVM requirements in the range ft G 
[0.08,0.175] occur in Long Term Evolution (LTE) |9| Section 14.3.4]. 



The capacity is achieved by Q = Uj\/diag(<ii, . . . , c?m)Um. 
The upper bound in |6]l is asymptotically tight for any Nt. 

Proof: The capacity-achieving Q is derived as in (JJ, 
using the Hadamard inequality. The capacity limit follows as 
Q = jjXJm^Jm achieves the upper bound. ■ 

Although the capacity behaves differently under impair- 
ments, the optimal waterfilling power allocation in Corollary [2] 
is the same as for ideal transceivers (also noted in (7)). When 
N t > N r , the capacity limit Aflog 2 (l + jg^r) is improved 
by increasing Nt, because a deterministic H enables selective 
transmission in the A^ r non-zero channel dimensions while the 
transmitter-distortion is isotropic over all Nt dimensions. 

We conclude the analysis by elaborating on the fact that the 
lower bound in |6]l is always asymptotically achievable. 

Corollary 3. If the channel distribution /g is unknown 
at the transmitter, the worst-case mutual information 
min/ H I(x; y, H) is maximized by Q = ^1 (for any a) and 
approaches M log 2 (l + -4) as P— >oo. 

A. Numerical Illustrations 

Consider a channel with N t = N r = A and varying SNR. 
Fig. [2] shows the average capacity over different determinis- 
tic channels, either generated synthetically with independent 
C7V(0, l)-entries or taken from the channel measurements in 
(TTJ. The level of impairments is varied as k € {0.05, 0.1}. 

Ideal and physical transceivers behave similarly at low and 
medium SNRs in Fig. [2] but fundamentally different at high 
SNRs. While the ideal capacity grows unboundedly, the capac- 
ity with impairments approaches the capacity limit (74.4(00) = 
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Fig. 2. Average capacity of a 4x4 MIMO channel over different deterministic 
channel realizations and different levels of transceiver impairments. 

41og 2 (l + ^j) in Theorem [T] The difference between the 
uncorrelated synthetic channels and the realistically correlated 
measured channels vanishes asymptotically. Therefore, only 
the level of impairments, k, decides the capacity limit. 

Next, we illustrate the case N t > N r and different a. Fig. [3] 
considers N t <E {4, 12}, while having N r = 4, n = 0.05, and 
two different channel distributions: deterministic (we average 
over independent CAf(0, 1) -entries) and uncorrelated Rayleigh 
fading. We show a € {0, 1} in the deterministic case, while 
the random case gives Q = ^ T and same capacity for any a. 

These channels perform similarly and have the same capac- 
ity limit when N t = 4. The convergence to the capacity limit 
is improved for the random distribution when Nt increases, 
but the value of the limit is unchanged. Contrary, the capacity 
limits in the deterministic cases increase with Nt (and with 
a since it makes the distortion more isotropic). Fig. [5] shows 
that there is a medium SNR range where the capacity exhibits 
roughly the same A/-slope as achieved asymptotically for 
ideal transceivers. Following the terminology of [3], this is the 
degrees-of-freedom (DoF) regime while the high-SNR regime 
is the saturation regime; see Fig. [3] This behavior appeared 
in (3) for large cellular networks due to limited coherence 
time, but we demonstrate its existence for any physical MIMO 
channel (regardless of size) due to transceiver impairments. 

IV. Gain of Multiplexing 

The MIMO capacity with ideal transceivers behaves as 
M \og 2 (P) + 0(l) 11), thus it grows unboundedly in the high- 
SNR regime and scales linearly with the so-called multiplexing 
gain M = min(N t , N r ). On the contrary, Theorem [T] shows 
that the capacity of physical MIMO channels has a finite upper 
bound, giving a very different multiplexing gain: 

CN t ,N r {P) 



M 



classic 



= lim 

P^oc log 2 (P) 



= 0. 



(8) 



In view of (MJ, one might think that the existence of a non- 
zero multiplexing gain is merely an artifact of ignoring the 
transceiver impairments that always appear in practice. How- 
ever, the problem lies in the classical definition, because also 
physical systems can gain in capacity from employing multiple 
antennas and utilizing spatial multiplexing. A practically more 
relevant measure is the relative capacity improvement (at a 
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Fig. 3. Capacity of a MIMO channel with N r = 4 and impairments with 
K = 0.05. We consider different Nt, channel distributions, and a-values. 

finite P) of an Nt x N r MIMO channel over the corresponding 
single-input single-output (SISO) channel. 

Definition 1. The finite -SNR multiplexing gain, A4(P), is the 
ratio of MIMO to SISO capacity at a given P. For |2) we get 

C Nt ,N r (P) 



M{P) 



(9) 



CiAP) ' 

This ratio between the MIMO and SISO capacity quantifies 
the exact gain of multiplexing. The concept of a finite-SNR 
multiplexing gain was introduced in p2) for ideal transceivers, 
while the refined Definition [T] can be applied to any channel 
model. The asymptotic behavior of M(P) is as follows. 

Theorem 2. Let h denote the SISO channel. The finite-SNR 
multiplexing gain, Ai(P), for |2]) and any a satisfies 



E{||H||3} 

N t E{\h\ 2 } 



< 



lim M(P) < 

P-s-0 



M < lim M(P) < M 



E{||H||1} 
log 2 (l + 



N t 
Mk' 



log 2 (l + i) 



(10) 



(11) 



where || • ||f and || • || 2 denote the Frobenius and spectral norm, 
respectively. The upper bounds are achieved for deterministic 
channels (with full rank and a = 1). The lower bounds are 
achieved for right-rotationally invariant channel distributions. 

Proof: The low-SNR behavior is achieved by Taylor 



approximation: Q 



I gives the lower bound, while the 



per-realization-optimal Q = uu H (where u is the dominating 
eigenvector of H ff H) gives the upper bound. The high-SNR 
behavior follows from Theorem Q] and its corollaries. ■ 
This theorem indicates that transceiver impairments have 
little impact on the relative MIMO gain, which is a very 
positive result for practical applications. The low-SNR be- 
havior in (JTUJ) is the same as for ideal transceivers (since 
PHT t H fl + I « I), while {□} shows that physical MIMO 
channels can achieve A4(P) > M in the high-SNR regime 
(although ideal transceivers only can achieve A4(P) = M). 

A. Numerical Illustrations 

The finite-SNR multiplexing gain is shown in Figs. [4] and [5] 
for uncorrelated Rayleigh fading and deterministic channels, 
respectively, with N t e {4, 8, 12}, N r = 4, n = 0.05, a = 1. 

The limits in Theorem [2] are confirmed by the simulations. 
Although the capacity behavior is fundamentally different for 
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Fig. 4. Finite-SNR multiplexing gain for an uncorrelated Rayleigh fading 
channel with N r = 4 and N* > 4. 
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Fig. 5. Average finite-SNR multiplexing gain of deterministic channels 
(generated with independent CAf(0, l)-entries) with N r = 4 and Nt > 4. 

physical and ideal transceivers, the finite-SNR multiplexing 
gain is remarkably similar — not unexpected since the asymp- 
totic limits in Theorem [2] are almost the same for any level 
of transceiver impairments. The main difference is in the 
high-SNR regime, where (a) there is a faster convergence to 
the limits under impairments and (b) deterministic channels 
achieve an asymptotic gain higher than M when N t > N r . 

V. Concluding Implications 

Unlike conventional capacity analysis, the capacity of physi- 
cal MIMO systems saturates in the high-SNR regime (see The- 
orem [T} and the finite capacity limit is channel independent. 
This fundamental result is explained by the distortion from 
transceiver impairments and that its power is proportional to 
the signal power. The classic multiplexing gain is thus zero 
(see Eq. |8]l). Nevertheless, the MIMO capacity grows roughly 
linearly with M = imn(N t ,N r ) (see Theorem |2|i over the 
whole SNR range, thus showing that also physical systems 
can achieve great gains from spatial multiplexing. 

Technological advances can reduce transceiver impairments, 
but there is currently an opposite trend towards small low-cost 
low-power transceivers where the inherent dirty RF effects are 
inevitable and the transmission is instead adapted to them. 

The point-to-point MIMO capacity limit in Theorem [T] is 
also an upper bound for scenarios with extra constraints; for 
example, network MIMO which is characterized by distributed 
power constraints and limited coordination both between trans- 
mit antennas and between receive antennas. The capacity in 
such scenarios therefore saturates in the high-SNR regime — 
even in small networks where the analysis in (3) is not ap- 
plicable. Note that the finite-SNR multiplexing gain decreases 
when adding extra constraints JT0} and that impairments limit 
the asymptotic accuracy of channel acquisition schemes. 



Appendix: Proof of Theorem Q] 

As a preliminary, consider any full-rank channel realization 
H. Let H H H = UmAmUj/ denote a compact eigendecom- 
position (with U M € C NiXM , A M € C MxM ; see Corollary 
[2}. The mutual information is increasing in P and satisfies 

log 2 det(l+PHQH ff (PHTtH^+I)" 1 ) 
= log 2 dct(l+FUf f (Q + T t )U M A M ) 

- log 2 det(l+PUf f T f U M A M ) -> 
log 2 det(U^(Q+Y t )U M A M ) - l 0g2 det(U^T t U M A M ) 
= log 2 det(l + U^QUm(U^TlWm)- 1 ) (12) 

as P — > 00. The first equality follows from expanding the loga- 
rithm and using the rule det(I+AB) — det(I+BA). This en- 
ables taking the limit P — > 00 and achieve an expression where 
the impact of Am cancels out. We then identify the projec- 
tion matrix n r f/ 2 u M - T? /2 \J M {^ M r t \J m)- 1 ^ T t 1/2 
onto U^Y^ 2 . The ith strongest eigenvalue is denoted fii(-). 

The convergence as P —> 00 is uniform, thus we can achieve 
bounds by showing that all realizations has the same asymp- 
totic bound. A lower bound is given by any feasible Q; we 
select Q = -^-1 as it gives Y t = ffl and makes ( |T2] > indepen- 
dent of H. Since ( p"3j ) is a Schur-concave function in the eigen- 
values, an upper bound is achieved by replacing /Xi(-) with 



= log 2 det(l + Y~ 1/2 QY 

M 

= ^log 2 (l+^(Y t - 1/2 QY7 H/2 n T f/ 2 u M ) 



the average eigenvalue -^tr(Y f 1/2 QY t H ^ 2 Ilx" /2 u M ) < 
^tr(Y~ 1/2 QY~ H/2 ) = where the inequality follows 

from removing the projection matrix (since n-r? /2 u M di I)- 
Note that the upper and lower bounds coincide when Nt < N r , 
thus Q = jjl is asymptotically optimal in this case. 
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